Goto

Collaborating Authors

 marten 2014


90fd4f88f588ae64038134f1eeaa023f-AuthorFeedback.pdf

Neural Information Processing Systems

Thank you for all the helpful comments. Several related works were raised by the reviewers which we discuss here. We note that the authors have marked their ArXiv submission as containing errors. Each of their inner loops uses SGD to solve the distance-regularized objectives. First, we use the EMA of slow weights to adjust the training parameters during optimization.